Joint Variable Selection and Classification with Immunohistochemical Data
نویسندگان
چکیده
To determine if candidate cancer biomarkers have utility in a clinical setting, validation using immunohistochemical methods is typically done. Most analyses of such data have not incorporated the multivariate nature of the staining profiles. In this article, we consider modelling such data using recently developed ideas from the machine learning community. In particular, we consider the joint goals of feature selection and classification. We develop estimation procedures for the analysis of immunohistochemical profiles using the least absolute selection and shrinkage operator. These lead to novel and flexible models and algorithms for the analysis of compositional data. The techniques are illustrated using data from a cancer biomarker study.
منابع مشابه
Finding stability regions for preserving efficiency classification of variable returns to scale technology in data envelopment analysis
This paper addresses issue of sensitivity of efficiency classification of variable returns to scale (VRS) technology for enhancing the credibility of data envelopment analysis (DEA) results in practical applications when an additional decision making unit (DMU) needs to be added to the set being considered. It also develops a structured approach to assisting practitioners in making an appropria...
متن کاملNegative Selection Based Data Classification with Flexible Boundaries
One of the most important artificial immune algorithms is negative selection algorithm, which is an anomaly detection and pattern recognition technique; however, recent research has shown the successful application of this algorithm in data classification. Most of the negative selection methods consider deterministic boundaries to distinguish between self and non-self-spaces. In this paper, two...
متن کاملA Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملMedial Collateral Ligament Injury; A New Classification Based on MRI and Clinical Findings. A Guide for Patient Selection and Early Surgical Intervention
Medial collateral ligament (MCL)injury, is one of the most common ligament injuries of the knee,mostly results from a valgus force.Restoration of function and going back to the pre-injury level of function is the aim of treatment in ligament injuries of the knee. There are multiple soft tissue structures in medial side that play an important role in connection with each other to retain medial s...
متن کاملOptimum simultaneous discretization with data grid models in supervised classification: a Bayesian model selection approach
In the domain of data preparation for supervised classification, filter methods for variable ranking are time efficient. However, their intrinsic univariate limitation prevents them from detecting redundancies or constructive interactions between variables. This paper introduces a new method to automatically, rapidly and reliably extract the classificatory information of a pair of input variabl...
متن کامل